MIFS-ND: A mutual information-based feature selection method
نویسندگان
چکیده
Feature selection is used to choose a subset of relevant features for effective classification of data. In high dimensional data classification, the performance of a classifier often depends on the feature subset used for classification. In this paper, we introduce a greedy feature selection method using mutual information. This method combines both feature–feature mutual information and feature–class mutual information to find an optimal subset of features to minimize redundancy and to maximize relevance among features. The effectiveness of the selected feature subset is evaluated using multiple classifiers on multiple datasets. The performance of our method both in terms of classification accuracy and execution time performance, has been found significantly high for twelve real-life datasets of varied dimensionality and number of instances when compared with several competing feature selection techniques. 2014 Elsevier Ltd. All rights reserved.
منابع مشابه
Conditional Mutual Information Based Feature Selection for Classification Task
We propose a sequential forward feature selection method to find a subset of features that are most relevant to the classification task. Our approach uses novel estimation of the conditional mutual information between candidate feature and classes, given a subset of already selected features which is utilized as a classifier independent criterion for evaluation of feature subsets. The proposed ...
متن کاملConditional Mutual Information - Based Feature Selection Analyzing for Synergy and Redundancy
© 2011 ETRI Journal, Volume 33, Number 2, April 2011 Battiti’s mutual information feature selector (MIFS) and its variant algorithms are used for many classification applications. Since they ignore feature synergy, MIFS and its variants may cause a big bias when features are combined to cooperate together. Besides, MIFS and its variants estimate feature redundancy regardless of the correspondin...
متن کاملInput feature selection for classification problems
Feature selection plays an important role in classifying systems such as neural networks (NNs). We use a set of attributes which are relevant, irrelevant or redundant and from the viewpoint of managing a dataset which can be huge, reducing the number of attributes by selecting only the relevant ones is desirable. In doing so, higher performances with lower computational effort is expected. In t...
متن کاملMutual Information-Based Feature Selection for Prostate Cancer Diagnosis Using Ultrasound Images
Figure 5 – Estimated PDFs:) (x p ,) | (cancerous C x p = , and) | (benign C x p =. Abstract This paper deals with the subject of feature extraction and feature selection for prostate ultrasound image classification. Feature extraction simply means to pull out useful measures from the segmented prostate structure. Feature selection implies choosing a subset of features, which have the minimum in...
متن کاملCorrelation Feature Selection and Mutual Information Theory Based Quantitative Research on Meteorological Impact Factors of Module Temperature for Solar Photovoltaic Systems
The module temperature is the most important parameter influencing the output power of solar photovoltaic (PV) systems, aside from solar irradiance. In this paper, we focus on the interdisciplinary research that combines the correlation analysis, mutual information (MI) and heat transfer theory, which aims to figure out the correlative relations between different meteorological impact factors (...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Expert Syst. Appl.
دوره 41 شماره
صفحات -
تاریخ انتشار 2014